PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KHN00829.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family MYB
Protein Properties Length: 1690aa    MW: 184240 Da    PI: 5.6074
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KHN00829.1genomeTCUHKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding285.1e-09784825346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT+eE e +++ ++ +G++ +++Ia+ +  ++t+ +c+++++k
       KHN00829.1 784 PWTPEEREVFLEKFAAFGKD-FRKIASFLD-HKTAADCVEFYYK 825
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding33.68.8e-119991038344
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrw 44  
                        WT +E   +++av  +G++ +++Iar++g +R+ +qck ++
       KHN00829.1  999 DWTDDEKTAFLQAVSSFGKD-FAKIARCVG-TRSQEQCKVFF 1038
                       5*****************99.*********.********766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466893.59E-14768828IPR009057Homeodomain-like
PROSITE profilePS5129316.282780831IPR017884SANT domain
SMARTSM007171.4E-9781829IPR001005SANT/Myb domain
PfamPF002491.1E-6783825IPR001005SANT/Myb domain
CDDcd001671.63E-7784826No hitNo description
Gene3DG3DSA:1.10.10.605.0E-6784825IPR009057Homeodomain-like
PROSITE profilePS5129312.839951046IPR017884SANT domain
SMARTSM007172.2E-89961044IPR001005SANT/Myb domain
SuperFamilySSF466896.63E-109971046IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.5E-69991038IPR009057Homeodomain-like
PfamPF002496.0E-99991038IPR001005SANT/Myb domain
CDDcd001671.12E-710001038No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1690 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESLGSVAR WRDSSHHRDF NRWGSAEFRR PLGHGKQGGW  60
HLFSEESGHG YAISRSSSDK MLEDDSRPSF SRGDGKYGRS SRENRGGPFG QRDWRGHSWE  120
PSNGSISFPR RQQDVNNDHR SIDDALAYSP HPHSDFGNAW DQHHLKDQHD KMGGVNDFGA  180
GPRCDRENSL GDWKPLKWTR SGSLSSRGSG FSHSSSSRSM GGADSHEAKA ELLPKSVAVN  240
ESHSGEAAAC ATSSVPSEDT TSRKKPRLGW GEGLAKYEKK KVEVPEASAN KDGPVLSTSN  300
TEPCNLLSPS LVDKSPKVIG FSECASPATP SSVACSSSPG MDDKLFGKTA NVDNDVSNLT  360
GSPAPVSENH FARFSFNLEK FDIDSLNNLG SSIIELVQSD DPTSLDSGPM RSNSINKLLI  420
WKADISKVLE MTESEIDLLE NELKSLKSES GETCPCSCPV ALGSQMVGGD EKYGEEHVGV  480
SDQVIRPLPL KVVDDPNTEK MPLSTNLHSI HENGKEEDID SPGTATSKFV EPLPLIKAVS  540
CDTRGYDNFS RDLDAVQSTA VKCLVPCTTR KEASVSTFVD GNTSMALKDS MDILYKTIIS  600
SNKESANRAS EVFDKLLPKD CCKIEKMEAS SDTCTHTFIM EKFAEKKRFA RFKERVIALK  660
FRALHHLWKE DMRLLSIRKC RPKSHKKNEL SVRSTCNGIQ KNRLSIRSRF PFPGNQLSLV  720
PTSEIINFTS KLLSESQVKV QSNTLKMPAL ILDEKEKMIS KFVSSNGLVE DPLAIEKERA  780
MINPWTPEER EVFLEKFAAF GKDFRKIASF LDHKTAADCV EFYYKNHKSD CFEKIKKQDG  840
CKLGKSYSAK TDLIASGKKW NRELSASSLD ILSAASLMAD GIAGNKKLRT GSSLLGGYGK  900
VKTSRGEDFI EKSSSFDILG DERETAAAAD VLAGICGSLS SEAMSSCITS SVDPVEGNRD  960
RKFLKVNPLC KPPMTPDVTQ DVDDETCSDE SCGEMDPTDW TDDEKTAFLQ AVSSFGKDFA  1020
KIARCVGTRS QEQCKVFFSK GRKCLGLDLM RPIPENVGSP VNDDANGGES DTDDACVVET  1080
GSVVGTDKSG TKTDEDLPLY GTNTYHDESH PVEARNLSAE LNESKEIIGT EVDLEDANVT  1140
SGAYQINIDS EQGCDGSEVF LCVSNKSGSV GEQAGIIMSD STEVGKDKAN KLGGAATELI  1200
SAPDSSEPCE SNSVAEDRMV VSEVSSGGLG NELERYRVSA TLCVDDRDNK YEADSGVIVD  1260
LKSSVHDLST MVNSSLSSLG TSCSGLSFCS ENKHVPLGKP HVSALSMDDL LATSNSLLQN  1320
TVAVDVQCEK TASQDQMSST CDIQGGRDMH CQNSISNAGH QLPITGNLSD HVDAVSILQG  1380
YPFQVPLKKE MNGDMNCSSS ATELPFLPHK IEQDDDHIKT FQSSDSDKTS RNGDVKLFGK  1440
ILTNPSTTQK PNVGAKGSEE NGTHHPKLSS KSSNLKFTGH HSADGNLKIL KFDHNDYVGL  1500
ENVLENVPMR SYGYWDGNRI QTGLSTLPDS AILLAKYPAA FSNYPTSSAK LEQPSLQTYS  1560
KNNERLLNGA PTLTTTRDIN GSNAVIDYQV FRRDGPKVQP FMVDVKHCQD VFSEMQRRNG  1620
FEAISSLQQQ SRGVMGMNGV GRPGILVGGS CSGVSDPVAA IKMHYSNSDK YGGQTGSIAR  1680
EDESWGGKGD
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_D5e-17747833994NUCLEAR RECEPTOR COREPRESSOR 2
4a69_C5e-17747833994NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006606233.10.0PREDICTED: uncharacterized protein LOC100810588 isoform X3
RefseqXP_003556223.20.0PREDICTED: uncharacterized protein LOC100810588 isoform X2
TrEMBLA0A0B2NZZ30.0A0A0B2NZZ3_GLYSO; Nuclear receptor corepressor 1
STRINGGLYMA20G31871.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF49863352
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein
Publications ? help Back to Top
  1. Qi X, et al.
    Identification of a novel salt tolerance gene in wild soybean by whole-genome sequencing.
    Nat Commun, 2014. 5: p. 4340
    [PMID:25004933]